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Figure 1: 

Query= INSP087 

(1357 letters) 

Database: All non- redundant GenBank CDS 
translations+PDB+SwissProt+PIR+PRF 

1/039,285 sequences; 328,747,273 total letters 



Searching. 



.done 



Sequences producing significant alignments: 



Score E 
(bits) Value 



ref |XP_090334.2) (XM_090334) similar to ovostatin precursor - ch. . 
ref |XP_132895.1) (XM_132895) similar to ovomacroglobulin, ovosta. . 
emb| CAA55385.il (X78801) ovomacroglobulin, ovostatin [Gallus gal., 
sp I P2 0740 I OVOS_CHTCK OVOSTATIN PRECURSOR (OVOMACROGLOBULIN) >gi 1 . . 
ref INPJ300005.1I (NM_000014) alpha 2 macroglobulin precursor [Ho., 
ref |NPJ)36620.1| <NM_012488) alpha- 2-macroglobul in [Rattus norve.. 
prf I | 100917 4A macroglobulin alpha2 [Homo sapiens] 

ref |NP_002855.1| (NM_002864> pregnancy-zone protein; Pregnancy z. . 
pir| JJC5143 alpha-macroglobulin precursor - guinea pig >gi 113040.. 
ref |NP_665722.1) (NM_145779) pregnancy- zone protein (Rattus norv. . 
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Note: The top two matches are XP_090334.2 and XP_132895.1 which were predicted by 
computational automated analysis using a gene prediction method. 
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Figure 2: 

>emb|CAA55385*l| (X78801) ovomacroglobulin, ovostatin [Gallus gallus] 
Length - 1454 

Score - 1170 bits (3027) , Expect =0.0 

Identities - 615/1369 (44%), Positives = 896/1369 (64%), Gaps = 31/1369 (2%) 



Query: 1 VPQARSDPLAFITFSAKGATLNLEERRSVAIRSRENWFVQTDKPTYKPGQKVHILTLFL 60 

+P S LAFI+F+AKG T +L+ERRSV I + E+ VFVQTDKP YKPGQ V + L 
Sbjct: 84 IPPVTSVSLAFISFTAKGTTFDLKERRSVMIWNMESFVFVQTDKPIYKPGQSVMFRWAL 143 

Query: 61 -FLFQ YPVITLQDPQNNRIFQRQNVTSFRNITQLSFQLISEPMFGDYWIWKRNS 114 

F F+ YP+I +QDPQNNRIFQ QNVTS NI Q+ F L EP+ G+Y I+V + S 

Sbjct: 14 4 DFNFKPVQEMYPLIAVQDPQNNRIFQWQNVTSEINIVQIEFPLTEEPILGNYKIIVTKKS 203 

Query: 115 RETVTHQFAVKRYVLPKFEVTVNAPQTVTISDDEFQVDVCAKYNFGQPVQGETQIRVCRE 174 

E +H F V+ YVLPKF+VTV AP +-I-T+ D E V +CA. Y +GQPV+G+ Q+ VCR+ 
Sbjct: 204 GERTSHSFLVEEYVLPKFDVTVTAPGSLTVMDSELTVKICAVYTYGQPVEGKVQLSVCRD 2 63 

Query: 175 YFSSSNCEKNENEICEQFIAQLE-NGCVSQIVNTKVFQLYRSGLFMTFHVAVIVTESGTV 233 

+ S C+K+ +C+ F L+ +GC+S I+++KVF+L R G V IVTE V 

Sbjct: 264 FDSYGRCKKSP — VCQSFTKDLDTDGCLSHILSSKVFELNRIGYKRNLDVKAIVTEKEQV 321 

Query: 234 MQISEKTSVFITQLLGTVNFENMDTFYRRGISYFGTLKFSDPNNVPMVNKLLQLELNDEF 293 

++ S+ ITQ++ +-f FEN+D YRRGI YFG +K D +N P+ NK++QL +N++ 
Sbjct: 322 CNLTATQSISITQVMSSLQFENVDHHYRRGIPYFGQIKLVDKDNSPISNKVIQLFVNNKN 381 

Query: 294 IGNYTTDENGEAQFSIDTSDIFDPEFNLKATYVRPESCYLPSWLTPQYLDAHFLVSRFYS 353 

N+TTD NG A FSIDTS IFDPE +LKA Y + C+ W+ P Y DA V R YS 
Sbjct: 382 THNFTT DINGI APFS I DT SKI FDPELS LKALYKT S DQCHSEGW IEP SYPD ASLSVQRLY S 441 

Query: 354 RTNSFLKIVPEPKQLECNQQKVVTVHYSLNSEAYEDDSNV1CFFYLMMVKGAILLSGQKEI 413 

T+SF++I P K + C Q++++TV+Y LN+E YE + V F+Y+ M KG I+L+G+ ++ 
Sbjct: 442 WTSSFVRIEPLWKDMSCGQKRMITVYYILNTEGYEHINIWFYYVGMAKGKIVLTGEIBCV 501 

Query: 414 RNKA-WNGNFS FPISISADLAPAAVLFVYTLHPSGEIVADSVRFQVDKCFKHKVNIKFSN 472 

+A NG F P+ ++ +APA L VY LHP+ E+VADSVRF ++KCFK+KV ++FS 
Sbjct: 502 NIQADQNGTFMIPLWNEKMAPALRLLVYMLHPAKELVADSVRFSIEKCFKNKVQLQFSE 561 

Query: 473 EQGLPGSNASLCLQAAPVLFCALRAVDRNVLLLKSEQQLSAESVYNMVPSIEPYGYFYHG 532 

+Q L SN SL ++AA FCA+RAVD+++LLLKSE +LSAE++YN+ P + GY ++G 
Sbjct : 562 KQMLTTSNVSLVIEAAANSFCAVRAVDKSMLLLKSETELSAETIYNLHPIQDLQGYIFNG 621 
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Query: 533 LNLDDGKEDPCIPQRDME^GLYYTPVSNYGDGDIYNIVRNMGIJCVFTNIJiYRKPEVC^M 592 

LNL+D +DPC+ D+F+ GLYY P+++ D+Y +R+MG+K FTN R+P VC 
Sbjct: 622 LNLEDDPQDPCVSSDDIFHKGLYYRPLTSGLGPDVYQFLRDMGMKFFTNSKIRQPTVCTR 681 

Query: 593 ERRLPLPKPLYLETENYGPMRSVPSRIACRGENADYVEQAIIQTVRTNFPETWMWDLVSV 652 

E P P Y + + +++ + ++ I++T+R FPETW+WD++ + 

Sbjct: 682 ETVRP PSYFLNAGF-TASTHHVKLSAEVAREERGKRHILETIREFFPETWIWDIILI 737 

Query: 653 DSSGSANLSFLIPDTITQWEASGFCVNGDVGFGISSTTTLEVSQPFFIEIASPFSWQNE 712 

+S+G A++S+ IPDTIT+W+AS FCV GFG+S TL QPFF+++ P+S++ E 
Sbjct: 738 NSTGKASVSYTIPDTITEWKASAFCVEELAGFGMSVPATLTAFQPFFVDLTLPYSIIHGE 797 

Query: 713 QFDLIVNVFSYRNTCVEISVQVEESQNYEANIHTLKINGSEVIQAGGRKTNVWTIIPKKL 772 

F + NVF+Y N C++I+V + ES +Y+A + + + +G + A RK+ VW I PK 
Sbjct: 798 DFLVRANVFNYLNHCIKINVLLLE S LDYQAKLI S PE DDG — CVCAKIRKSYVWNIFPKGT 855 

Query: 77 3 GKVNIT WAE SKQS S ACPNEGMEQQKLNWKDT WQS FLVE PEGIEKERTQS FLI CTEGAK 832 

G V ++ AE+ AC E + +++++DT +++ LVEPEGI +E TQ+FLIC + 
Sbjct: 856 GDVLFSITAETNDDEACEEEALRNIRIDYRDTQIRALLVEPEGIRREETQNFLICMKDDV 915 

Query: 833 ASKQGVLDLPNDWEGSARGFFTWGDILGLALQNL-WLQMPYGSGEQNAALLASDTYV 891 

S+ +DLP +WEGS R F+WGDI+G A+QN+ +LQMP+G+GEQN L A + YV 
Sbj ct : 916 ISQDVAIDLPTNWEGSPRPSFSWGDIMGTAIQNVHQLLQMPFGNGEQNMVLFAPNIYV 975 

Query: 892 LDYLKSTEQLTEEVQSKAFFLLSNGYQRQLS FKNS DGS YS VFWQQSQKGS — I CALTFKT 949 

LDYL T QL+E+V+SK L +GYQ+QLS+K+ DGSYS F + ++G+ + A +K+ 
Sbjct: 97 6 LDYLDKTRQLSEDVKSKTIGYLVSGYQKQLSYKHPDGSYSTFGIRDKEGNTWLTAFVYKS 1035 

Query: 950 LERMKKYVFI DENVQKQTLIWLSSQQKTSGCFKNDGQLFNHAWEGGDEEDISLTAYWGM 1009 

++++ID+NVQ QTLIWL+++QKT GCF++ G L N+A +GG E ++SL+AY+ 
Sbjct: 1036 FAEASRFIYIDDNVQAQTLIWLATKQKTDGCFQSTGILVNNAMKGGVENELSLSAYITIA 1095 

Query: 1010 FFEAGLNFTFPALRNALFCLEAALDSGVTNGYNHAILAYAFALAGKEKQVESLLQTLDQS 1069 

EAG + + +RNA +CLE A + +T+ Y A++AYAF LAGK + ES L+ L +S 
Sbjct: 1096 LLE AGHSMS HT VI RN AFYCLE T ASE KN I T D I YTQ ALVAYAFCLAGKAE ICE S FLRE LQKS 1155 

Query: 1070 APKLNNVIYWERERKPKTEEFPSFIPWAPSAQTEKSCYVLLAVI SRKIPDLTYASKI 1126 

A +++ YWE+ ++ E+ + S E + YVLLA++ +R DLT AS I 
Sbjct: 1156 AKEVDGSKYWEQNQRSAPEK-SHLLDHVQSTDVEITSYVLLALLYKPNRSQEDLTKASAI 1214 

Query: 1127 VQWLAQRMNSHGGFSSNQETAVCLLAITRYITQGLFSKDQNTVTFSSEGSSE-IFQVNGH 1185 

VQW+ ++ NS+GGF+S Q+T V L A+ Y S QN + +S+ + E +F VN 

Sbjct : 1215 VQWIIRQQNSYGGFASMQDTWALQALAAYGAATYNSVTQNVIKINSKNTFEKVFTVNNE 1274 



WO 2004/035626 



10/531751 

PCT/GB2003/004500 



4/24 



Query: 1186 NRLLVQRSEVTQAPGEYTVDVEGHGCTFIQATLKYNVLLPKKASGFSLSLEIVKNYSST- 1244 

NRLL+Q++ + Q PG+Y++ V G GC IQ L+YN+ LP+ A GFSLS++ N S 
Sbjct: 1275 NRLLLQQTPLPQVPGKYSLTVNGTGCVLIQTALRYNIHLPEGAFGFSLSVQ-TSNASCPR 1333 

Query: 1245 AFDLTVTLKYTGIRNKSSMWIDVKMLSGFTPTMSSIEELENKGQVMKTEVKNDHV 1300 

FD+ + YTG R+ S+MV+IDVKMLSGF P SS+++L + VM+ E K +HV 
Sbjct: 1334 DQPGKFDIVLI S S YTGKRS SSNMVI I DVKMLSGFVPVKSSLDQLIDDHTVMQVEYKKNHV 1393 

Query: 1301 LFYLENGF-GRADSFPFSVEQSNLVFNIQPAPAMVYDYYEKEEYALAFY 1348 

L YL N R FSVEQ +V + +PAP +YDYYE EEYA+A Y 

Sbjct: 1394 LLYLGN I LQKRRKEVT FS VEQDFWTHPKPAPVQI YDYYETEE YAVAE Y 1442 
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Figure 3: 

Query= INSP088_pep 

(894 letters) 

Database: All non-redundant GenBank CDS 
translations+PDB+SwissProt+PIR+PRF 

1,039,285 sequences; 328,747,273 total letters 



Score E 

Sequences producing significant alignments: (bits) Value 



dbj 1BAC047 93.1I (AK096448) unnamed protein product [Homo sapiens] 


974 


0. 


0 


ref |NP_000005.1| (NM_000014) alpha 2 macroglobulin precursor [Ho... 


770 


0. 


0 


prf | I1009174A macroglobulin alpha2 [Homo sapiens] 


765 


0. 


0 


ref |NP_036620.1] (NM_012488) alpha-2-macroglobulin [Rattus norve. . . 


754 


0. 


0 


pir| |JC5143 alpha-macroglobulin precursor - guinea pig >gij 13040... 


736 


0. 


0 


ref |NP_075591.1| (NM_023103) alpha (1) -inhibitor 3, variant I [Ra... 


734 


0. 


0 


sp|P14046|AH3_RAT ALPHA- 1- INHIBITOR III PRECURSOR >gi | 91945 | pir .. . 


723 


0. 


0 


ref |NP_002855.1| (NM_002864) pregnancy-zone protein; Pregnancy z .. . 


719 


0. 


0 


pir| UC514 4 murinoglobulin precursor - guinea pig 


718 


0. 


0 


dbj IBAA12317.1I (D84339) murinoglobulin [Cavia porcellus] 


715 


0. 


0 
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Figure 4: 

>ref |NP_000005.1| (NM_000014) alpha 2 macroglobulin precursor [Homo sapiens] 
sp|P01023|A2MG_HOMAN Alpha-2-macroglobulin precursor (Alpha-2-M) 
pir| IMAHU alpha-2 -macroglobulin precursor - human 

gb | AAA51551.1| (M11313) alpha-2-macroglobulin precursor [Homo sapiens] 
Length = 1474 

Score o 770 bits (1989), Expect =0.0 

Identities = 417/917 (45%), Positives - 580/917 (62%), Gaps - 40/917 (4%) 



Query: 1 QVSLGFSPSQQLPGAEVELQLQAAPGSLCALRAVDESVLLLRPDRELSNRSVYGMFPFWY 60 

+V L FSPSQ LP + L++ AAP S+CALRAVD+SVLL++PD ELS SVY + P 
Sbjct: 567 KVDLSFSPSQSLPASHAHLRVTAAPQSVCALRAVDQSVLLMKPDAELSASSVYNLLP 623 

Query : 61 GHYPYQVAEYDQCPVSGPWDFPQPLIDPMPQGHSSQRSIIWRP-SFSEGTDLFS FFRDVG 119 

ED GP+Q D+++ I+PS+ D++SF D+G 

Sbjct: 624 EKDLTGFPGPLN- DQDDEDCINRHNVYINGIT YT PVS STNEKDMYS FLE DMG 674 

Query: 120 LKILSNAKIKKPVDCSH RSPE YSTAMGAGGGHPEAFESSTPLHQAEDSQ 168 

LK +N+KI+KP C PE + + G GH P 
Sbjct: 675 LBCAFTNSKIRKPKMCPQLQQYEMHGPEGLRVGFYESDVMGRGHARLVHVEEP HTET 730 

Query: 169 VRQYFPETWLWDLFPIGNSGKEAVHVTVPDAITEWKAMSFCTSQSRGFGLSPTVGLTAFK 228 

VR+YFPETW+WDL + ++G V VTVPD ITEWKA +FC S+ G G+S T L AF+ 
Sbjct: 731 VRKYFPETWIWDLVWNSAGVAEVGVTVPDTITEWKAGAFCLSEDAGLGISSTASLRAFQ 790 

Query: 229 PFFVDLTLPYSWRGESFRLTATIFNYLKDCIRVQTDLAKSHEYQLESWADSQTSSCLCA 288 

PFFV+LT+PYSV+RGE+F L AT+ NYL CIRV L S + Q C+CA 

Sbjct: 791 PFFVELTMPYSVIRGEAFTLKATVLNYLPKCIRVSVQLEASPAFLAVPVEKEQAPHCICA 850 

Query: 289 DDAKTHHWNITAVKLGHINFTISTKILDSNEPCGGQKGFVPQKGRSDTLIKPVLVKPEGV 348 

+ +T W +T LG++NFT+S + L+S E CG + VP+ GR DT+IKP+LV+PEG+ 
Sbjct: 851 NGRQTVSWAVTPKSLGNVNFTVSAEALESQELCGTEVPSVPEHGRKDTVIKPLLVEPEGL 910 

Query: 349 LVEKTHSSLLCPKGKVASESVSLELPVDIVPDSTKAYVTVLGDIMGTALQNLDGLVQMPS 408 

E T +SLLCP G SE +SL+LP ++V +S +A V+VLGDI+G+A+QN L+QMP 
Sbjct: 911 EKETTFNSLLCPSGGEVSEELSLKLPPNWEESARASVSVLGDILGSAMQNTQNLLQMPY 970 

Query: 409 GCGEQNMVLFAPIIYVLQYLEKAGLLTEEIRSRAVGFLEIGYQKELMYKHSNGSYSAFGE 468 

GCGEQNMVLFAP IYVL YL + LT E++S+A+G+L GYQ++L YKH +GSYS FGE 
Sb j ct : 971 GCGEQNMVLFAPNIYVLDYLNETQQLTPEVKSKAIGYLNTGYQRQLNYKHYDGSYSTFGE 1030 
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Query: 469 RDG NGNTWLTAFVTKC FGQAQKFI FI DPKN I QDALKWMAGNQL PS GC YANVGNLLHTA 526 

R G GNTWLTAFV K F QA+ +IFID +1 AL W++ Q +GC+ + G+LL+ A 
Sbjct: 1031 RYGRNQGNTWLTAFVLKTFAQARAYIFIDEAHITQALIWLSQRQKDNGCFRSSGSLLNNA 1090 

Query: 527 MKGGVDDEVSLTAYVTAALLEMGKDVDDPMVSQGLRCLKNSATST TNLYTQALL 580 

+KGGV+ DEV+L+AY+T ALLE+ V P+V L CL+++ + +++YT+ALL 
Sbjct: 1091 IKGG VE DE VT L S AYI T I ALLE I PLT VTHP WRN AL FCLE S AWKT AQEG DHG S H VYTKALL 1150 

Query: 581 AYIFSLAGEMDIRNILLKQLDQQAIISGESIYWSQKPTPSSNASPWSEPAA — VDVELTA 638 

AY F+LAG D R +LK L+++A+ S++W + P + + EP A +VE+T+ 
Sbjct: 1151 AYAFALAGNQDKRKEVLKSLNEEAVKKDNSVHWERPQKPKAPVGHFYE PQAPSAEVEMT S 1210 

Query: 639 YALLAQLT- KP S LTQKE I AKAT S I VAWLAKQHNAYGG FS S TQDT WALQALAKY-ATTAY 696 

Y LLA LT +P+ T +++ AT-flV W+ KQ NA GGFSSTQDTWAL AL+KY A T 
Sbjct: 1211 YVLLAYLTAQPAPTSEDLTSATNIVKWITKQQNAQGGFSSTQDTWALHALSKYGAATFT 1270 

Query: 697 MPSEEINLWKSTENFQRTFNIQSVNRLVFQQDTLPNVPGMYTLEASGQGCVYVQTVLRY 756 

+ + ++S+ F F + + NRL+ QQ +LP +PG Y+++ +G+GCVY+QT L+Y 
Sbjct: 1271 RTGKAAQVTIQSSGTFSSKFQVDNNNRLLLQQVSLPELPGEYSMKVTGEGCVYLQTSLKY 1330 

Query: 757 NILPPTNMKTFSLSVEIGKARCEQPTSPRSLTLTIHTSWGSRSSSNMAIVEVKMLSGFS 816 

NILP F+L V+ C++P + S +++ SY GSRS+SNMAIV+VKM+SGF 

Sbjct: 1331 NILPEKEEFPFALGVQTLPQTCDEPKAHTSFQISLSVSYTGSRSASNMAIVDVKMVSGFI 1390 

Query: 817 PMEGTNXXXXXXPLVKKVEFGTDTLNIYLDELIKNTQTYTFTISQSVLVTNLKPATIKVY 87 6 

P++ T V + E ++ + IYLD++ T + FT+ Q V V +LKPA +KVY 

Sbjct: 1391 PLKPTVKMLERSNHVSRTEVSSNHVLIYLDKVSNQTLSLFFTVLQDVPVRDLKPAIVKVY 1450 



Query: 877 DYYLPDEQATIQYS DPC 893 

DYY DE A +Y+ PC 
Sbjct: 14 51 DYYETDEFAIAEYNAPC 14 67 
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Figure 5: 

1 gttcctcagg ccagatctga cccactggca tttattacat tttctgctaa aggagccact 

vpq ars dpla fit fsa kgat 

61 ctcaacctgg aagagaggag atctgtggca atcagatcca gagagaatgt ggtcttcgta 
Inl eer rsva irs ren vvfv 

121 cagactgata aacccaccta caagcctgga cagaaagttc atatattaac attattttta 
qtd kpt y k p g qkv hil tlfl 

181 tttttatttc agtatccagt gatcaccctt caggatcctc aaaacaatcg gatttttcaa 
flf qyp vitl qdp qnn rifq 

241 aggcaaaatg tgacttcttt ccgaaatatt acccaactct cgttccaact gatttcagaa 
rqn vts frni tql sfq lise 

301 ccaatgtttg gagattactg gattgttgtg aaaagaaact caagggagac agtgacacac 
pmf gdy wivv krn sre tvth 

361 caatttgctg ttaaaagata tgtgctgccc aagtttgaag ttacagtcaa tgcaccacaa 
qfa vkr yvlp kfe vtv napq 

421 acagtaacta tttcagatga tgaattccaa gtggatgtat gtgctaagta caactttggc 
tvt isd defq vdv cak ynfg 

481 caacctgtgc aaggggaaac ccaaatccgg gtgtgcagag agtatttttc ttcaagcaat 
qpv qge tqir vcr eyf sssn 

541 tgtgagaaaa atgaaaatga aatatgtgag caatttattg cacagttgga aaatggttgt 
cek nen eice qfi aql engc 

601 gtttctcaaa ttgtaaatac aaaagtcttc caactctacc gttcgggatt gttcatgaca 
vsq ivn tkvf qly rsg lfrat 

661 tttcatgtcg ctgtaattgt tacagaatct gggacagtta tgcagatcag cgagaagacc 
fhv avi vtes gtv mqi sekt 

721 tcagttttta tcactcaatt gcttggaact gtaaactttg agaacatgga tacattctat 
svf itq llgt vnf enm dtfy 

781 agaagaggga tttcttattt tggaactctt aaattttcgg atcccaataa tgtacctatg 
rrg isy fgtl kfs dpn nvpm 

841 gtgaacaagt tgttgcaact ggagctcaat gatgaattta taggaaatta cactacggat 
vnk llq leln def ign yttd 

901 gagaatggcg aagctcaatt ttccattgac acttcagaca tatttgatcc agagttcaac 
eng eaq fsid tsd ifd pefn 

961 ctaaaagcca catatgttcg acctgagagc tgctatcttc ccagctggtt gacgcctcag 
Ika tyv rpes cyl psw ltpq 

1021 tacttggatg ctcacttctt agtctcacgc ttttactccc gaaccaacag cttcctgaag 
yld ahf lvsr fys rtn sflk 

1081 attgttccag aaccaaagca gcttgaatgt aatcaacaga aggttgttac tgtgcattac 
ivp epk qlec nq q kvv tvhy 
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1141 tccctaaaca gtgaagcata tgaggatgat tccaatgtaa agttcttcta tttgatgatg 
sin sea yedd snv kff ylmm 

1201 gtaaaaggag ctatcttact cagtggacaa aaggaaatca gaaacaaagc ctggaatgga 
vkg ail lsgq kei rnk awng 

12 61 aacttctcgt tcccaatcag catcagtgct gatctggctc ctgcagccgt cctgtttgtc 
nfs fpi sisa dla paa v 1 f \ 

1321 tatacccttc accccagtgg ggaaattgtg gctgacagtg tcagattcca ggttgacaag 
ytl hps geiv ads vrf qvdk 

1381 tgctttaaac acaaggttaa cataaagttc tctaacgagc agggcttacc tggttccaat 
cfk hkv nikf sne qgl pgsn 

1441 gctagtctct gtcttcaagc ggcgcctgtc ttattctgtg ccctcagggc tgtggatagg 
a s 1 clq aapv lfc air avdr 

1501 aatgtccttc tactgaaatc tgaacaacag ctgtcagctg aaagtgtgta taacatggtt 
n v 1 Ilk seqq lsa esv ynmv 

1561 ccaagtatag agccgtatgg ttatttctac catggcctca atcttgatga tggcaaggaa 
psi epy gyfy hgl nld dgke 

1621 gacccttgca ttcctcagag ggatatgttc tacaatggtt tatattacac acctgtaagc 
dpc ipq rdmf yng lyy tpvs 

1681 aactatgggg atggagatat ctataatatt gtcaggaaca tgggtctaaa agtctttacc 
nyg dgd iyni vrn mgl kvft 

1741 aatctccatt accgaaaacc agaagtatgt gtgatggaga gaaggctgcc actccctaag 
n 1 h yrk pevc vme rrl plpk 

1801 ccgctttatc tggaaacaga aaattatggt ccaatgcgta gtgttccgtc tagaattgca 
Ply let enyg pmr svp sria 

18 61 tgtagagggg agaatgctga ctatgtagaa caggctataa ttcaaacagt aagaacaaac 
erg ena dyve qai iqt vrtn 

1921 ttcccagaga catggatgtg ggacctcgtc agtgtcgatt cctcaggctc tgecaatett 
f p e twm wdlv svd ssg sanl 

1981 tcgttcctca ttcctgatac gataacccaa tgggaggcaa gtggcttttg tgtgaatggt 
sfl ipd titq wea sgf cvng 

2041 gacgttggat ttggcatttc ctctacaacc actctagaag tctcccaacc tttctttatt 
d v g fgi sstt tie vsq pffi 

2101 gagattgect cacccttttc ggttgttcaa aatgaacaat ttgatttgat tgtcaatgtc 
e i a spf svvq neq fdl ivnv 

2161 ttcagctacc ggaatacatg tgtagagatt tctgttcaag tggaggagtc tcagaattat 
f s y rnt cvei svq V ee sqny 

2221 gaagcaaata ttcatacctt gaaaatcaat ggcagtgagg ttattcaagc tggagggagg 
e a n iht lkin gse viq aggr 

2281 aaaacaaacg tctggactat tatacctaag aaattgggta aagtgaatat cactgtagtt 
k t n vwt iipk klg kvn itvv 
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2341 gctgagtcca aacaaagcag tgcttgccca aatgaaggaa tggagcagca aaagctaaac 
aes kqs sacp neg meq qkln 

2401 tggaaagaca ctgtggtcca aagcttctta gtagagcctg aaggtattga aaaggaaagg 
wkd tvv qsfl vep egi eker 

2461 acccagagtt tccttatctg tacagaaggt gccaaagcct ccaagcaggg agttttggac 
tqs fli cteg aka skq gvld 

2521 ttgccaaacg atgtagtaga agggtcagcc agaggctttt tcactgttgt gggggatatt 
Ipn dvv egsa rgf ftv vgdi 

2581 ctaggacttg ccttgcagaa tctggttgtt ctccaaatgc cctatggaag tggagagcag 
lgl alq nlvv lqm pyg sgeq 

2641 aatgctgccc tactagcatc tgatacttat gttctggact atctgaaatc tactgagcaa 
naa 11a sdty vld ylk steq 

2701 ctgacagagg aagttcaatc taaggctttc tttctcttat ctaatggtta tcaaaggcaa 
lte evq skaf £11 sng yqrq 

27 61 ttatctttca aaaactctga tggttcctat agtgtgtttt ggcagcagag tcagaaagga 
1 s f kns d.gsy svf wqq sqkg 

2821 agcatatgtg ctcttacttt taagacattg gagagaatga aaaaatatgt attcattgat 
sic alt fktl erm kky vfid 

2881 gaaaatgttc aaaaacagac cttaatctgg ctttcaagcc aacagaaaac aagcggctgc 
env qkq tliw lss qqk tsgc 

2941 tttaagaatg atggccagct tttcaaccac gcctgggagg gtggagatga agaggacatt 
fkn dgq lfnh awe ggd eedi 

3001 tcactcactg cgtatgttgt tgggatgttc tttgaagctg ggctcaattt cacttttcct 
sit ayv vgmf fea gin ftfp 

3061 gctctacgaa acgcactctt ttgccttgaa gcggcattgg acagtggtgt cactaatggc 
air nal fcle aal dsg vtng 

3121 tataatcatg caattctagc ttatgctttt gccttagctg gaaaagagaa gcaagtggaa 
ynh ail ayaf ala gke kqve 

3181 tctttactcc aaaccctgga tcaatctgcc ccaaaactaa ataatgtcat ctactgggaa 
sll qtl dqsa pkl nnv iywe 

3241 agagaaagga aacccaagac agaagaattt ccatccttta ttccctgggc accttctgct 
rer kpk'teef psf ipw apsa 

3301 cagactgaga agagttgcta cgtgctgttg gctgtcattt cccggaaaat tcctgacctc 
qte ksc yvll avi srk ipdl 

3361 acctatgcta gtaagattgt gcagtggctt gcccaacgga tgaattccca tggaggcttt 
tya ski vqwl aqr mns hggf 

3421 tcttccaacc aggaaactgc agtttgtctt cttgccataa cccgctacat aacccagggg 
ssn qet avcl lai try itqg 



3481 ctcttctcta aggatcaaaa cactgtcacc tttagcagtg aaggatccag tgagattttc 
lfs kdq ntvt fss egs seif 
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3541 
3601 
3661 
3721 
3781 
3841 
3901 
3961 
4021 



caggttaacg gtcataaccg cctactggtc caacgttcag aagtaacaca ggcacctgga 

qvn ghn rllv qrs evt qapg 

gaatacacag tagatgtgga aggacacggt tgtacattta tccaggccac ccttaagtac 

eyt vdv eghg ctf iqa t 1 k y 

aatgttctcc tacctaagaa ggcatctgga ttttctcttt ccttggaaat agtaaagaac 

nvl lpk kasg fsl sle ivkn 

tactcttcga ctgcttttga cctcacagtg accctcaaat acactggaat tcgcaataaa 

yss taf dltv tlk ytg irnk 

vvi dvkm lsg ftp tmss 




s n 1 



«5 

V f 



i q p a pam vyd yyek 



4081 
4141 
4201 



gcaattactg gaagaggtga agaaatttta ttacgtcata aaccattgaa aacacatcta 
gtaagaaaat gaaaacctga ataagatagg acagtagttg aagaaagaaa agtgtctggt 
acttcattag actt 



jg^i INSP08 7 predicted receptor binding domain. 
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Figure 6 



a) 



INSP087rbd ATGGTGGTTATAGATGTAAAAATGCTATCAGGATTTACTCCAACCATGTCATCCATTGAA 

2243344 ATGGTGGTTATAGATGTAAAAATGCTATCAGGATTTACTCCAACCATGTCATCCATTGAA 
************************************************************* 



INS PO 8 7 rbd GAGCTTGAAAACAAGGGCCAAGTGATGAAGACTGAAGTCAAGAATGACCATGTTCTTTTC 
224334 4 GAGCTTGAAAACAAGGGCCAAGTGATGAAGACTGAAGTCAAGAATGACCATGTTCTTTTC 



INS PO 8 7 rbd TACTTGGAAAATGGTTTTGGTCGAGCAGACAGTTTCCCTTTTTCTGTTGAGCAGAGCAAC 
2243344 TACTTGGAAAATGGTTTTGGTCGAGCAGACAGTTTCCCTTTTTCTGTTGAGCAGAGCAAC 
************************************************************ 



INSP087rbd CTTGTGTTCAACATTCAGCCAGCCCCAGCCATGGTCTACGATTATTACGAAAAAGAAGAA 
224334 4 CTTGTGTTCAACATTCAGCCAGCCCCAGCCATGGTCT ACGATTACTATGAAAAAGAAGAA 

******************************************** ** ************ 



INSP087rbd TATGCCCTAGCTTTTTACAACATCGACAGTAGTTCAGTTTCCCAG 
2243344 T ATGCCCTAGCTTTTTACAACATCGACAGTAGTTCAGTTTCCGAG 

****************************************** ** 



b) 



INSP087rbd MWIDVKMLSGFTPTMSSIEELENKGQVMKTEVKNDHVLFYLENGFGRADSFPFSVEQSN 
224334 4 MWIDVKMLSGFTPTMSSIEELENKGQVMKTEVKNDHVLFYLENGFGRADSFPFSVEQSN 
************************************************************ 



INSP087rbd 
2243344 



LVFN I QPAPAMVYD YYEKEE Y ALAFYN IDSSSVSQ 
LVFNIQPAPAMVYDYYEKEEYALAFYNIDSSSVSE 

★fc********************************^ 
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Figure 7: 

1 caggtttccc ttggcttctc cccctcccag cagcttccag gagcagaagt ggagctgcag 
qvs lgf spsq qlp gae velq 

61 ctgcaggcag ctcccggatc cctgtgtgcg ctccgggcgg tggatgagag tgtcttactg 
Iqa apg slca Ira vcie svll 

121 cttaggccag acagagagct gagcaaccgc tctgtctatg ggatgtttcc attctggtat 
lrp cire lsnr svy gmf pfwy 

181 ggtcactacc cctatcaagt ggctgagtat gatcagtgtc cagtgtctgg cccatgggac 
ghy pyq vaey dqc pvs gpwd 

241 tttcctcagc ccctcattga cccaatgccc caagggcatt cgagccagcg ttccattatc 
fpq pli dpmp qgh ssq rsii 

301 tggaggccct cgttctctga aggcacggac cttttcagct ttttccggga cgtgggcctg 
wrp sfs e.gtd lfs ffr dvgl 

361 aaaatactgt ccaatgccaa aatcaagaag ccagtagatt gcagtcacag atctccagaa 
kil sna kikk pvd csh rspe 

421 tacagcactg ctatgggtgc aggcggtggt catccagagg cttttgagtc atcaactcct 
yst amg aggg hpe afe sstp 

481 ttacatcaag cagaggattc tcaggtccgc cagtacttcc cagagacctg gctctgggat 
lhq aed sqvr qyf pet wlwd 

541 ctgtttccta ttggtaactc ggggaaggag gcggtccacg tcacagttcc tgacgccatc 
lfp ign sgke avh vtv pdai 

601 accgagtgga aggcgatgag tttctgcact tcccagtcaa gaggcttcgg gctttcaccc 
tew k a m sfct sqs rgf glsp 

661 actgttggac taactgcttt caagccgttc tttgttgacc tgactctccc ttactcagta 
tvg Ita fkpf fvd ltl pysv 

721 gtccgtgggg aatcctttcg tcttactgcc accatcttca attacctaaa ggattgcatc 
vrg esf rlta tif nyl kdci 

781 agggttcaga ctgacctggc taaatcgcat gagtaccagc tagaatcatg ggcagattct 
rvq tdl aksh eyq les wads 

841 cagacctcca gttgtctctg tgctgatgac gcaaaaaccc accactggaa catcacagct 
qts scl cadd akt hhw nita 

901 gtcaaattgg gtcacattaa ctttactatt agtacaaaga ttctggacag caatgaacca 
vkl ghi nfti stk ild snep 

961 tgtgggggcc agaaggggtt tgttccccaa aagggccgaa gtgacacgct catcaagcca 
egg qkg fvpq kgr sdt likp 

1021 gttctegtea aacctgaggg agtcctggtg gagaagacac acagctcatt gctgtgccca 
vlv kpe gvlv ekt hss llcp 

1081 aaaggaaagg tggcatctga atctgtctcc ctggagctcc cagtggacat tgttcctgac 
kgk vas esvs lei pvd ivpd 
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1141 tcgaccaagg cttatgttac ggttctggga gacattatgg gcacagccct gcagaacctg 
stk ayv tvlg dim gta Iqnl 

1201 gatggtctgg tgcagatgcc cagtggctgt ggcgagcaga acatggtctt gtttgctccc 
dgl vqm psgc geq nmv lfap 

1261 atcatctatg tcttgcagta cctggagaag gcagggctgc tgacggagga gatcaggtct 
iiy vlq ylek agl lte eirs 

1321 cgggcagtgg gtttcctgga aatagggtac cagaaggagc tgatgtacaa acacagcaat 
rav gfl eigy q k e lmy khsn 

1381 ggctcataca gtgcctttgg ggagcgagat ggaaatggaa acacatggct gacagcgttt 
gsy saf gerd gng ntw ltaf 

1441 gtcacaaaat gctttggcca agctcagaaa ttcatcttca ttgatcccaa gaacatccag 
vtk cfg qaqk fif idp kniq 

1501 gatgctctca agtggatggc aggaaaccag ctccccagtg gctgctatgc caacgtggga 
dal kwm agnq lps gey anvg 

1561 aatctccttc acacagctat gaagggtggt gttgatgatg aggtctcctt gaetgegtat 
nil hta mkgg vdd evs Itay 

1621 gtcacagctg cattgctgga gatgggaaag gatgtagatg acccaatggt gagtcagggt 
vta all emgk dvd dpm vsqg 

1681 ctacggtgtc tcaagaattc ggccacctcc acgaccaacc tctacacaca ggccctgttg 
lrc Ikn sats ttn lyt qall 

1741 gcttacattt tctccctggc tggggaaatg gacatcagaa acattctcct taaacagtta 
a y i fsl agem dir nil Ikql 

1801 gatcaacagg ctatcatctc aggagaatcc atttactgga gccagaaacc tactccatca 
dqq aii sges iyw sqk ptps 

1861 tcgaacgcca gcccttggtc tgagcctgcg gctgtagatg tggaactcac ageatatgea 
sna spw sepa avd vel taya 

1921 ttgttggccc agcttaccaa gcccagcctg actcaaaagg agatagegaa ggccactagc 
11a qlt kpsl tqk eia kats 

1981 atagtggctt ggttggccaa gcaacacaat gcatatgggg gcttctcttc tactcaggat 
iva wla kqhn ayg gfs stqd 

2041 actgtagttg ctctccaagc tettgecaaa tatgecacta ccgcctacat gecatctgag 
tvv alq alak yat tay mpse 

2101 gagatcaacc tggttgtaaa atccactgag aatttccagc gcacattcaa catacagtca 
ein lvv kste nfq rtf niqs 

2161 gttaacagat tggtatttca gcaggatacc ctgcccaatg tccctggaat gtacacgttg 
vnr Ivf qqdt lpn vpg mytl 

2221 gaggectcag gecagggctg tgtctatgtg cagacggtgt tgagatacaa tattctccct 
eas gqg cvyv qtv lry nilp 

2281 cccacaaata tgaagacctt tagtcttagt gtggaaatag gaaaagctag atgtgagcaa 
ptn mkt fsls vei gka rceq 
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2341 ccgacttcac ctcgatcctt gactctcact attcacacca gttatgtggg gagccgtagc 
pts prs Itlt iht syv gsrs 

2401 t cttccjg|§^ 

ssn mai v e v k mis gfs pmeg 

24 6i giillH 

tnq 111 qqpl vkk vef gtdt 

2521 l^fcPffflM^^ 

lni yld elik ntq tyt ftis 

2581 

qsv lvt nlkp ati kvy dyyl 

2 641 Bi^SKI^ gtccagcgga 
pde qat iqys dpc e 

27 01 gaaatgggtg gagttatggg ttagggtggc agaagttaag aggagcctct tttcgagtta 
27 61 ctgtcattgt ctttttttga gatagagtct cgcggtgttg cccaggctgg agggcagtgg 
2821 cggaggtt 



= INSP088 predicted receptor binding domain. 



WO 2004/035626 



a) 



b) 



10/531751 

PCT/GB2003/004500 



16/24 
Figure 8: 



INS P0 8 8 rbd AATATGGCTATTGTGGAAGTGAAGATGCTATCTGGGTTCAGTCCCATGGAGGGCACCAAT 
4753534 AATATGGCTATTGTGGAAGTGAAGATGCTATCTGGGTTCAGTCCCATGGAGGGCACCAAT 

INS PO 8 8 rbd CAGTTACTTCTCCAGCAACCCCTGGTGAAGAAGGTTGAATTTGGAACTGACACACTTAAC 
4753534 CAGTTACTTCTCCAGCAACCCCTGGTGAAGAAGGTTGAATTTGGAACTGACACACTT AAC 

******************************************^ 

INSP088rbd ATTTACTTGGATGAGCTCATTAAGAACACTCAGACTTACACCTTCACCATCAGCCT^AAGT 
4753534 ATTTACTTGGATGAGCTCATTAAGAACACTCAGACTTACACCTTCACCATCAGCCAAAGT 
************************************************************ 

INSP088rbd GTGCTGGTCACCAACTTGAAACCAGCAACCATCAAGGTCTATGACTACTACCTACCAGAT 
4753534 GTGCTGGT CACCAACTTGAAACCAGCAACCAT CAAGGTCTATGACTACTACCTACCAGAT 

★★a**************************************** 

INSP088rbd GAAGAGGCAACAATT CAGTATT CTGAT CCCTGTGAA 

4753534 GAAGAGGCAACAATTCAGTATTCTGAT CCCTGTGAA 

************************************ 



INSP088rbd NMAIVEVKMLSGFSPMEGTNQLLLQQPLVKKVEFGTDTLNIYLDELIKNTQTYTFTISQS 
4753534 NMAIVEVKMLSGFSPMEGTNQLLLQQPLVKKVEFGTDTLNIYLDELIKNTQTYTFTISQS 
************************************************************ 



INSP088rbd 
4753534 



VLVTNLKPAT I KVYD YYLP DEQAT I QYS DPCE 
VLVTNLKPATIKVYDYYLPDEQATIQYSDPCE 

******************************** 
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Figure 9: 



Molecule: pENTR-INSP088-sigptd-6HIS, 2615 bps DMA Circular 

File Name: pENTR-INSP088-sigptd-6HIS-Vlb. cm5 



Type 


Start 


End 


Name 


Description 


REGION 


27 


129 


attLl 




GENE 


136 


201 


IL12p40 


signal sequence 


GENE 


202 


483 


INSP088- 


-6HIS cds 


REGION 


487 


588 


attL2 




MARKER 


634 




C 


pENTR R primer 


GENE 


710 


1519 


KmR 




REGION 


1636 


2309 


pUC ori 




MARKER 


2612 






pENTR F primer 
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Figure 10: 



Molecule : 




pEAK12d 




rNSP088-sigptd-6HIS, 7297 bps DNA Circular 


File Name 


t 


pEAKl2d 




INSP088-sigptd-6HIS-Vlb . cm5 


Type 


Start 


End 




Name 


Description 


REGION 


2 


595 






pmb-ori 


GENE 


596 


1519 




AmpR 


Ampicillin resistance gene 


REGION 


1690 


2795 




EF-lalpha 


promoter 


MARKER 


2703 








pEAK12 F primer 


REGION 


2796 


2845 






MCS' 1 


REGION 


2855 


2874 






attBl 


GENE 


2888 


2953 




ILl2p40 signal sequence 


GENE 


2954 


3235 




INSP088-6HIS 


cds 
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3243 


3264 
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REGION 


3270 


3270 






'MCS 


REGION 


3271 


3699 






poly A/splice 


MARKER 


3385 




C 




pEAK12 R primer 


GENE 
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C 




Puromycin resistance gene 


REGION 


4542 


4319 


C 


tK 


tK promoter 


REGION 


5037 


4543 


C 


Ori P 
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7089 


5037 


C 


EBNA-1 




REGION 
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7289 




sv40 






Bsrrt 
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Figure 11: 
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caggtttccc 


ttggcttctc 


cccctcccag 


cagcttccag 


gagcagaagt 


ggagctgcag 




q v s 
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q 1 p 
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v e 1 q 


ol 
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tggatgagag 


tgtcttactg 




1 q a 
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s 1 c a 


Ira 


v d e 


s v 1 1 
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acagagagct 
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1 r p 


d r e 


1 s n r 


s v y 
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p f w y 
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ggtcactacc 


cctatcaagt 
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gatcagtgtc 
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cccatgggac 
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p v s 
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tttcctcagc 


ccctcattga 


cccaatgccc 


caagggcatt 


cgagccagcg 


ttccattatc 




f p q 


P 1 i 


d p m p 


q g h 


s s q 


r s i i 


301 


tggaggccct 


cgttctctga 


aggcaeggae 


cttttcagct 


ttttccggga 


cgtgggcctg 




w r p 


s f s 


e g t d 


1 f s 


f f r 


d v g 1 


361 


aaaatactgt 


ccaatgccaa 


aatcaagaag 


ccagtagatt 


gcagtcacag 


atctccagaa 




k i 1 


s n a 


k i k k 


p v d 


c s h 


r s p e 


421 


tacagcactg 


ctatgggtgc aggcggtggt catccagagg cttttgagtc 

INSP088-CP3 


atcaactcct 




y s t 


a m g 


a g g g 


h p e 


a f e 


s s t p 


481 


ttacatcaag cagaggattc 


tcaqqtccgc 


cagtacttcc 


cagaqacctq 


gctctgggat 




1 h q 


a e d 


s q v r 


q y f 


pet 


w 1 w d 


541 


ctgtttccta 


ttgqtaactc 


gqgqaagqag qcqqtccacq 


tcacaqttcc 


tgacgccatc 




1 f p 


i g n 


s g k e 


a v h 


v t V 


p d a i 


601 


accgaqtqqa aggcgatqag tttctgcact 


tcccagtcaa 


gaggcttegg gctttcaccc 




tew 


k a m 


s f c t 


s q s 


r g f 


g 1 s p 


661 


actgttgqac 


taactgcttt 


caaqccqttc 


tttgttqacc 


tgactctccc ttactcaqta 




t v g 


1 t a 


f k p f 


f v d 


1 t 1 


p y s v 


721 


gtccgtqqqq aatcctttcq tcttactqcc accatcttca 


attacctaaa 


gqattqeate 




v r g 


e s f 


r 1 t a 


t i f 


n y 1 


k d c i 


/ O 1 


aqggttcaga 


ctgacctggc 


taaatcqcat 


qaqtaccagc 


taqaatcatg 


ggcaqattct 




r v q 


t d 1 


a k s h 


e y q 


1 e s 


wads 


o4 1 


cagacctcca qttqtctctg tqctqatqac 


gcaaaaaccc 


accactqqaa 


catcacaqct 




q t s 


s c 1 


c a d d 


a k t 


h h w 


n i t a 


901 


gtcaaattgq gtcacattaa 


ctttactatt 


agtacaaaga 


ttctqqacaq 


caatqaacca 




v k 1 


g h i 


n f t i 


s t k 


i 1 d 


s n e p 


961 


tgtggqgqcc 


aqaagqqgtt 


tgttccccaa 


aaqggccgaa 


qtqacacget 


catcaaqcca 




egg 


q k g 


f v p q 


k g r 


s d t 


1 i k p 


1021 


gttctegtea 


aacctqaqqq agtcctqqtq qaqaaqacac 


acagctcatt 


gctqtqccca 




V 1 V 


k p e 


g v 1 v 


e k t 


h s s 


1 1 c p 


1081 


aaaqqaaaqq tqqcatctqa atctqtctcc 


ctqqagctcc 


caqtgqacat 


tgttcctgac 




k g k 


v a s 


e s v s 


lei 


p v d 


i v p d 



10/531751 

WO 2004/035626 PCT/GB2003/004500 

20/24 
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2341 ccqacttcac ctcqatcctt qactctcact attcacacca qttatqtqgq qagccgtagc 
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1NSP088-CP1* 
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INSP088-CP2 
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2701 gaaatgggtg gagttatggg ttagggtggc agaagttaag aggagcctct tttcgagtta 
27 61 ctgtcattgt ctttttttga gatagagtct cgcggtgttg cccaggctgg agggcagtgg 
2821 cggaggtt 



INSP088 predicted receptor binding domain. 
INSP088 predicted macroglobulin domain. 
Position and sense of PCR primers- 
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Figure 12: 

1 gtggtcatcc agaggctttt gagtcatcaa ctcctttaca tcaagcagag gattctcagg 
INSP088-CP3 

gh peaf ess tpl hqae dsq 

61 tccgccagta cttcccagag acctggctct gggatctgtt tcctattggt aactcgggga 
vrq yfpe twl wdl fpig nsg 

121 aggaggcggt ccacgtcaca gttcctgacg ccatcaccga gtggaaggcg atgagtttct 
kea vhvt vpd ait ewka msf 

181 gcacttccca gtcaagaggc ttcgggcttt cacccactgt tggactaact gctttcaagc 
cts qsrg fgl spt vglt a f k 

241 cattctttgt tgacctgact ctcccttact cagtagtccg tggggaatcc tttcgtctta 
pff vdlt lpy s v v rges frl 

301 ctgccaccat cttcaattac ctaaaggatt gcatcagggt tcagactgac ctggctaaat 
tat ifny lkd cir vqtd lak 

361 cgcatgagta ccagctagaa tcatgggcag attctcagac ctccagttgt ctctgtgctg 
she yqle swa dsq tssc lea 

421 atgaagcaaa aacccaccac tggaacatca cagctgtcaa attgggtcac attaacttta 
dea kthh wni tav klgh inf 

4 81 ctattagtac aaagattctg gacagcaatg aaccatgtgg gggccagaag gggtttgttc 
tis tkil dsn epc ggqk gfv 

541 cccaaaaggg ccgaagtgac acgctcatca agecagttet cgtcaaacct gagggagtcc 
pqk grsd tli kpv lvkp eg v 

601 tggtggagaa gacacacagc teattgetgt geccaaaagg aaaggtggca tctgaatctg 
Ive kths sll cpk gkva ses 

661 tctccctgga gctcccagtg gacattgttc ctgactcgac caaggcttat gttacggttc 
vsl elpv div pds tkay vtv 

721 tgggagacat tatgggcaca gccctgcaga acctggatgg tctggtgcag atgcccagtg 
lgd imgt alq nld glvq mps 

781 gctgtggcga gcagaacatg gtcttgtttg ctcccatcat ctatgtcttg cagtacctgg 
gcg eqnm vlf api iyvl qyl 

841 agaaggcagg getgetgacg gaggagatca ggtcteggge agtgggtttc ctggaaatag 
eka gilt eei rsr avgf lei 

901 ggtaccagaa ggagctgatg tacaaacaca gcaatggctc atacagtgee tttggggagc 
gyq kelm ykh sng sysa fge 

961 gagatggaaa tggaaacaca tggctgacag cgtttgtcac aaaatgcttt ggccaagctc 
rdg ngnt wit afv tkef g <3 a 

1021 agaaattcat cttcattgat cccaagaaca tecaggatge tctcaagtgg atggcaggaa 
qkf ifid pkn iqd alkw mag 



1081 accagctccc cagtggctgc tatgecaacg tgggaaatct ccttcacaca gctatgaagg 
nql psgc yan vgn llht amk 
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1141 gtggtgttga tgatgaggtc tccttgactg cgtatgtcac agctgcattg ctggagatgg 
ggv ddev sit ayv taal lem 

1201 gaaaggatgt agatgaccca atggtgagtc agggtctatg gtgtctcaag aattcggcca 
gkd vddp mvs qgl wclk nsa 

1261 cctccacgac caacctctac acacaggccc tgttggctta cattttctcc ctggctgggg 
tst tnly tqa 11a yifs lag 

1321 aaatggacat cagaaacatt ctccttaaac agttagatca acaggctatc atctcaggag 
emd irni Ilk qld qqai isg 

1381 aatccattta ctggagccag aaacctactc catcatcgaa cgccagccct tggtctgagc 
esi ywsq kpt pss nasp wse 

14 41 ctgcggctgt agatgtggaa ctcacagcat atgcattgtt ggcccagctt accaagccca 
paa vdve lta y a 1 laql tkp 

1501 gcctgactca aaaggagata gcgaaggcca ctagcatagt ggcttggttg gccaagcaac 
sit qkei aka tsi vawl akq 

1561 gcaatgcata tgggggcttc tcttctactc aggatactgt agttgctctc caagctcttg 
rna yggf sst qdt vval qal 

1621 ccaaatatgc cactaccgcc tacgtgccat ctgaggagat caacctggtt gtaaaatcca 
aky atta yvp see inlv vks 

1681 ctgagaattt ccagcgcaca ttcaacatac agtcagttaa cagattggta tttcagcagg 
ten fqrt fni qsv nrlv fqq 

1741 ataccctgcc caatgtccct ggaatgtaca cgttggaggc ctcaggccag ggctgtgtct 
dtl pnvp gmy t ie asgq gcv 

1801 atgtgcagac ggtgttgaga tacaatattc tccctcccac aaatatgaag acctttagtc 
yv q tvlr yni lpp tnmk tfs 

18 61 ttagtgtgga aataggaaaa gctagatgtg agcaaccgac ttcacctcga tccttgactc 
lsv eigk arc eqp tspr sit 

1921 tcactattca caccagttat gtggggagcc gtagctcttc caatatggct attgtggaag 
lti htsy vgs rss snma ive 

1981 tgaagatgct atctgggttc agtcccatgg agggcaccaa tcagttactt ctccagcaac 
vkm lsgf spm egt nqll lqq 

2041 ccctggtgaa gaaggttgaa tttggaactg acacacttaa catttacttg gatgagctca 
plv kkve fgt dtl niyl del 

2101 ttaagaacac tcagacttac accttcacca tcagccaaag tgtgctggtc accaacttga 
ikn tqty tft isq svlv tnl 

2161 aaccagcaac catcaaggtc tatgactact acctaccaga tgaacaggca acaattcagt 
kpa tikv ydy yip deqa tiq 

2221 attctgatcc ctgtgaatga ggtaagtgcc age 

M 1NSP088-CP2 

y s d pee 

Position and sense of PCR primers ► 



10/531751 

WO 2004/035626 PCT/GB2003/004500 

24/24 

Figure 13: 



Molecule: pCR4-TOPO-INSP088- 
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